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Statement  of  the  problem  studied 


This  transdisciplinary  project  sought  to  develop  new  statistical  methods  and  theory  for 
multivariate  nonparametric  regression  along  with  a  new  analytic  model  for  near-field 
emission  by  a  nanoparticle  on  a  surface.  A  short-term  goal  of  the  research  was  to 
establish  a  quantitative  framework  for  characterizing  nanoparticles  nonintrusively  and  in 
real  time  on  properties  that  could  influence  the  near-field  thermal  spectrum  in  a 
nanoscale-gap  thermophotovoltaic  (nano-TPV)  apparatus.  A  long-term  goal  was  (and  still 
is)  to  engineer  nanostructured  surfaces  selectively  emitting  thermal  radiation  in  the  near 
field  for  efficient  nano-TPV  power  generation. 

The  short-term  goal  was  pursued  through  three  Objectives.  The  first  Objective  was  to 
develop  theory  and  methods  for  compound  estimation  of  mean  response  functions  and 
their  derivatives  with  multiple  predictor  variables  and  multiple  (possibly  correlated) 
outcome  variables.  The  second  Objective  was  to  develop  an  analytic  model  for  near-field 
thermal  emission  by  a  nanoparticle  (modeled  as  an  electric  point  dipole)  on  a  surface  to 
describe  how  nanoparticle  properties  (e.g.,  size  and  refractive  index)  affected  the  near¬ 
field  thermal  spectrum.  The  third  Objective  entailed;  (i)  predicting  the  outcomes  of 
laboratory  experiments  in  which  particles  on  a  surface  scattered  evanescent  waves 
generated  by  total  internal  reflection  of  an  external  radiation  beam;  and,  (ii)  assessing  the 
potential  utility  of  a  nanoparticle  characterization  framework  based  on  the  compound 
estimation  of  evanescent  wave  scattering  patterns  and  their  derivatives. 
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Summary  of  the  most  important  results 


Our  accomplishments  are  broadly  summarized  in  the  next  three  paragraphs, 
eorresponding  to  the  three  Objeetives  from  our  proposal,  and  then  deseribed  in  further 
detail  in  the  seetion  below. 

For  Objeetive  1,  the  nonparametrie  regression  teehnique  of  eompound  estimation  was 
extended  from  the  ease  of  a  single  outeome  variable  and  a  single  predietor  variable  [1]  to 
seenarios  with  multiple  predietor  variables  [2]  and  multiple  outeome  variables  [3,4],  The 
former  extension  also  allowed  for  some  of  the  predietor  variables  to  be  handled 
nonparametrieahy  while  others  were  handled  parametrieahy,  an  important  eonsideration 
for  problems  in  whieh  the  number  of  predietor  variables  is  large  and  the  eurse  of 
dimensionality  is  a  eoneern  [5].  The  latter  extension  is  remarkable  in  that  nonlinear 
shrinkage  a  la  Stein’s  phenomenon  [6,7,8]  ean  be  exploited  to  reduee  estimation  error 
without  saerifieing  the  self-eonsisteney  property  that  the  estimated  derivatives  of  the 
mean  response  funetion  are  in  faet  the  derivatives  of  the  estimated  mean  response 
function  -  a  property  not  enjoyed  by  some  widely  aeeepted  nonparametrie  regression 
teehniques  [9,10].  Besides  the  preeeding,  a  new  tuning  parameter  seleetion  eriterion 
eahed  multivariate  GCp  was  developed  to  aeeommodate  nonparametrie  regression  models 
with  multiple  predietor  variables  whose  realizations  are  irregularly  distributed  and  in 
whieh  the  goal  ineludes  reeovering  derivatives  of  the  mean  response  funetion  rather  than 
only  the  mean  response  funetion  itself  [11].  This  eriterion  overeomes  the  limitations  of 
numerous  existing  approaehes,  whieh  are  designed  to  work  with  only  a  single 
nonparametrie  regression  teehnique  [12,13],  ealibrated  to  reeovery  of  the  mean  response 
funetion  while  not  eonsidering  reeovery  of  its  derivatives  [14,15],  or  eonfined  to 
seenarios  with  a  single  predietor  variable  [16]. 

Regarding  Objeetive  2,  a  novel  method  for  solving  near- field  thermal  radiation  problems 
in  eomplex  three-dimensional  geometries  eahed  the  Thermal  Diserete  Dipole 
Approximation  (T-DDA)  has  been  developed,  implemented  and  tested.  The  basies  of  the 
T-DDA  and  the  validation  of  the  approaeh  have  been  published  reeently  [17].  The  T- 
DDA  has  been  expanded  to  aeeommodate  for  interaetions  between  arbitrarily-shaped 
objeets  of  any  size  and  a  (semi-infinite)  surfaee.  The  T-DDA  with  Surfaee  Interaetion  (T- 
DDA-SI)  has  been  verified  against  results  for  a  nanopartiele,  modeled  as  an  eleetrie  point 
dipole,  and  a  surfaee  [18].  We  are  eurrently  in  the  proeess  of  verifying  the  T-DDA-SI 
against  experimental  results  for  a  large  sphere  (i.e.,  not  modeled  as  an  eleetrie  point 
dipole)  and  a  surfaee  [19].  A  manuseript  on  the  T-DDA-SI  will  be  submitted  in  2013 
[20]. 
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For  Objective  3,  predictions  of  evanescent  wave  scattering  by  particles  on  a  surface  have 
been  performed  using  the  Discrete  Dipole  Approximation  with  Surface  Interaction 
(DDA-SI)  [21,22],  For  the  first  time,  the  DDA-SI  has  been  validated  using  experimental 
data  obtained  from  a  scaled  microwave  apparatus  [23,24],  Supplementary  results  from  a 
finite  element  method  (FEM)  have  been  used  for  additional  verifications,  A  good 
agreement  between  the  DDA-SI  and  experimental  data  was  found,  thus  suggesting  that 
the  DDA-SI  can  be  employed  as  the  forward  model  in  an  evanescent  wave-based 
characterization  framework.  We  now  look  forward  to  more  fully  developing  the  inverse 
model  (i,e,,  creating  and  employing  the  statistical  procedures  that  can  convert 
experimental  data  on  evanescent  wave  scattering  into  more  accurate  inferences  about  the 
particles’  characteristics  than  is  currently  possible).  Preliminary  investigations  using  the 
compound  estimation  technique  from  Objective  1  have  yielded  mixed  results  but  provide 
some  cause  for  optimism  [4],  Using  an  extensive  database  of  simulated  scattering  data, 
we  found  that  simultaneously  using  multiple  outcome  variables  (corresponding  to 
different  elements  of  the  Mueller  scattering  matrix  [25])  for  nanoparticle  characterization 
yielded  more  accurate  classifications  than  using  a  single  outcome  variable.  Moreover, 
nonlinear  shrinkage  a  la  Stein’s  phenomenon  [6,7,8]  reduced  the  error  in  estimating 
scattering  profiles  and  their  derivatives.  Yet,  the  reduced  estimation  error  did  not 
necessarily  translate  into  more  accurate  classifications  when  these  classifications  were 
based  on  a  principle  of  minimizing  distance.  Careful  analysis  of  the  results  does, 
however,  provide  insights  about  what  avenues  to  pursue  next. 
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Details  of  scientific  progress  and  accomplishments 


Objective  1 

Compound  estimation  was  originally  developed  by  Chamigo  and  Srinivasan  [1]  as  a 
nonparametric  regression  technique  for  estimating  the  mean  of  a  single  outcome  variable 
as  a  smooth  function  of  a  single  predictor  variable  and,  simultaneously,  estimating  the 
derivatives  of  the  same  function.  Compound  estimation  possessed  the  self-consistency 
property  that  the  derivatives  of  the  estimated  mean  response  function  were  exactly  equal 
to  the  estimates  of  the  derivatives.  This  property  is  not  shared  by  the  technique  of  local 
regression  [9,10],  which  has  been  heralded  for  providing  optimal  convergence  rates  in 
nonparametric  regression.  Yet,  Charnigo  and  Srinivasan  [1]  theoretically  proved  that 
compound  estimation  achieved  virtually  the  same  convergence  rates,  and  they 
empirically  demonstrated  that  compound  estimation  could  substantially  outperform  local 
regression  in  finite  samples.  Thus,  we  were  motivated  to  extend  compound  estimation  to 
scenarios  with  multiple  predictor  variables  [2]  and  multiple  outcome  variables  [3,4]. 

Such  scenarios  have  numerous  practical  applications.  In  addition  to  two  biostatistical 
applications  described  later,  another  application  of  particular  interest  to  our  group  is 
nanoparticle  characterization.  Here  the  multiple  predictor  variables  could  be  observation 
angle  and  radiation  wavelength,  while  the  multiple  outcome  variables  could  convey 
multi-dimensional  information  about  the  manner  in  which  nanoparticles  to  be 
characterized  scattered  evanescent  waves;  more  specifically,  the  multiple  outcome 
variables  could  correspond  to  distinct  elements  of  the  Mueller  scattering  matrix  [25].  The 
idea  is  that  nanoparticles  of  differing  configurations  should  have  their  own  characteristic 
mean  response  functions.  Thus,  upon  acquiring  scattering  data  from  nanoparticles  to  be 
characterized,  an  experimenter  could  estimate  the  corresponding  mean  response  functions 
and  then  “match”  them  (and  their  derivatives)  to  previously  tabulated  results  in  a 
“library”  of  mean  response  functions  (and  their  derivatives)  obtained  from  nanoparticles 
whose  configurations  were  known.  This  idea  was  previously  explored  by  our  group 
[35,36,37]  but  only  in  scenarios  with  a  single  predictor  variable  and  a  single  outcome 
variable  (or,  if  there  were  multiple  outcome  variables,  statistical  inference  for  each  one 
was  treated  in  isolation  from  statistical  inference  for  the  others). 

For  the  present  research,  then,  we  began  by  extending  compound  estimation  to  scenarios 
with  multiple  predictor  variables  [2].  As  with  a  single  predictor  variable,  differentiation 
and  estimation  were  interchangeable  in  that  the  estimated  derivatives  of  the  mean 
response  exactly  equaled  the  corresponding  derivatives  of  the  estimated  mean  response. 
To  illustrate  this  concept.  Figure  1  in  this  final  project  report  [2]  depicts  local  regression 
estimates  of  a  mean  response  and  its  first-order  partial  derivatives  in  proximity  to  a  local 
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minimum  of  the  mean  response,  while  Figure  2  [2]  depiets  eompound  estimates  of  the 
same.  The  estimates  in  Figure  1  eonfliet  with  eaeh  other,  while  those  in  Figure  2  are 
eoherent:  the  estimated  derivative  of  the  mean  response  is  the  derivative  of  the  estimated 
mean  response.  We  theoretieally  proved  that,  in  seenarios  with  multiple  predietor 
variables,  eompound  estimation  aehieved  essentially  optimal  eonvergenee  rates  for 
reeovering  the  mean  response  and  its  partial  derivatives.  Moreover,  as  displayed  in  Table 
1,  eompound  estimation  eould  reduee  (a  diserete  approximation  to  mean  integrated) 
squared  error  by  amounts  between  25%  and  80%  versus  loeal  regression. 

We  also  explained  how  eompound  estimation  eould  be  employed  even  when  some 
eovariates  were  modeled  parametrieally  and  distinet  observations  on  the  same  outeome 
variable  might  be  eorrelated  [2].  The  former  eapability  eould  be  helpful  for 
eireumventing  the  eurse  of  dimensionality  [5]  when  there  are  numerous  eovariates  but 
there  is  little  reason  to  suspeet  that  more  than  a  eouple  of  them  have  relationships  with 
the  outeome  variable  that  need  to  be  expressed  nonparametrieally.  The  latter  eapability 
eould  be  valuable  for  repeated  measures  data.  To  illustrate,  we  applied  eompound 
estimation  to  a  publiely  available  data  set  on  telemonitoring  in  Parkinson’s  disease  [38]. 
Taking  into  aeeount  the  possible  eorrelations  among  repeated  measures  on  the  same 
subjeet  through  random  effeets  in  a  semiparametrie  regression  model,  we  sought  to  relate 
the  progression  of  Parkinson’s  disease  (nonparametrieally)  to  a  subjeet’s  age  and  the 
signal  fraetal  sealing  exponent  of  the  subjeet’s  reeorded  voiee  while  adjusting 
(parametrieally)  for  a  number  of  other  eovariates.  We  diseovered  that,  espeeially  among 
those  intermediate  in  age,  an  abnormal  signal  fraetal  sealing  exponent  might  portend 
greater  symptom  progression.  In  addition,  using  eompound  estimation  for  inferenee  on 
the  nonparametrie  eomponent  of  a  semiparametrie  regression  model  might  even  reduee 
(versus  loeal  regression)  Type  II  hypothesis  testing  errors  on  the  parametrie  eomponent. 

We  next  extended  eompound  estimation  to  multiple  outeome  variables  [3,4].  An  initial 
insight  was  obtained  by  writing  out  a  loeal  likelihood  with  multiple  outeome  variables 
and  noting  that  the  estimates  for  the  mean  response  and  its  derivatives  were  invariant  to 
the  eorrelation  (or  laek  thereof)  between  outeome  variables.  As  sueh,  handling  the 
estimation  for  one  outeome  variable  at  a  time  is  perhaps  not  as  naive  as  one  may  first 
suspeet.  However,  while  the  eorrelation  between  outeome  variables  may  not  by  itself  eall 
for  a  different  estimation  approaeh,  the  multiplieity  of  outeome  variables  may.  We 
reealled  Stein’s  phenomenon  [6,7,8],  whieh  in  effeet  says  that  the  maximum  likelihood 
estimator  of  a  multivariate  normal  mean  veetor  ean  be  improved  upon  (in  terms  of 
expeeted  squared  error)  through  nonlinear  shrinkage.  In  partieular,  every  eomponent  of 
the  maximum  likelihood  estimator  should  eontribute  to  the  estimation  of  any  single 
eomponent  of  the  multivariate  normal  mean  veetor.  Startlingly,  this  eonelusion  holds 
even  when  the  eovarianee  matrix  is  diagonal. 
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Figure  1  (adapted  from  [2]):  Absence  of  self-consistency  from  local  regression.  The  left 
panel  depicts  a  close-up  near  a  local  minimum  of  a  mean  response  function  as  inferred  by 
local  regression.  The  middle  and  right  panels  portray  corresponding  yet  contradictory 
estimates  of  partial  derivatives.  In  particular,  the  local  minimum  (marked  by  a  +  symbol) 
does  not  occur  where  both  partial  derivative  estimates  are  zero  (marked  by  a  x  symbol). 
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Figure  2  (adapted  from  [2]):  Presence  of  self-consistency  in  compound  estimation.  The 
left  panel  depicts  a  close-up  near  a  local  minimum  of  a  mean  response  function  as 
inferred  by  compound  estimation.  The  middle  and  right  panels  portray  corresponding  and 
compatible  estimates  of  partial  derivatives.  In  particular,  the  local  minimum  occurs 
precisely  where  both  partial  derivative  estimates  are  zero  (marked  by  a  +  symbol). 


Target 

Loeal 
regression, 
seenario  1 

Compound 
estimation, 
seenario  1 

Loeal 
regression, 
seenario  2 

Compound 
estimation, 
seenario  2 

mean  response 

42 

31 

40 

29 

first  partial  xi 

2,415 

885 

2,219 

808 

first  partial  X2 

2,534 

934 

2,363 

819 

mixed  partial 

16,824 

12,236 

15,939 

11,988 

seeond  partial  xi 

169,689 

34,599 

139,858 

31,972 

seeond  partial  X2 

181,616 

44,768 

157,398 

33,621 

Table  1  (adapted  from  [2]):  Comparing  compound  estimation  to  local  regression.  Entries 
are  the  averages,  over  100  simulated  data  sets,  of  the  sums  of  squared  errors  in  estimating 
the  indieated  targets  via  loeal  regression  and  eompound  estimation.  Both  seenarios  have  n 
=  225,  errors  drawn  from  the  standard  normal  distribution,  and  eovariates  regularly 
distributed  on  the  unit  square.  Mean  response  funetion  for  seenario  1  is  oos(37i  xi  /  2)  + 
sin(37i  X2  /  2),  and  mean  response  funetion  for  seenario  2  is  arotan(2xi)  X2^  (6-x2^)  /  2. 


Thus,  we  investigated  how  to  ineorporate  Stein’s  phenomenon  into  nonparametrie 
regression  in  the  presenee  of  multiple  outeome  variables,  whieh  may  or  may  not  be 
eorrelated  [3,4].  The  primary  eomplieation  to  extending  the  original  results  of  Stein  and 
James  [6,7]  turns  out  to  be  the  bias  inherent  to  nonparametrie  regression  estimation. 
However,  if  one  is  willing  to  assume  (or  ean  estimate)  an  upper  bound  for  the  bias,  then  a 
similar  nonlinear  shrinkage  -  or  “Steinization”  -  of  nonparametrie  regression  estimates 
ean  also  be  shown  to  reduee  expeeted  squared  error  [3].  Importantly,  though  perhaps  not 
surprisingly,  the  magnitude  of  the  shrinkage  ealled  for  is  related  to  this  upper  bound.  In 
partieular,  large  bias  both  ealls  for  a  lesser  degree  of  shrinkage  and  portends  fewer 
benefits  from  shrinkage.  Thus,  Steinization  may  be  of  most  value  in  one  of  two 
situations:  (i)  tuning  parameters  are  ehosen  that  err  on  the  side  of  “undersmoothing”  the 
data;  or,  (ii)  foeus  is  on  estimating  the  derivatives  rather  than  (only)  the  mean  response, 
sinee  for  given  tuning  parameters  the  bias/varianee  tradeoff  tilts  more  toward  varianee  as 
one  differentiates.  Of  eourse,  sinee  our  researeh  agenda  is  partieularly  eoneerned  with 
derivative  estimation,  finding  (ii)  is  not  unweleome. 
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Figure  3  illustrates  Steinization  in  nonparametrie  regression  [4],  More  specifieally,  loeal 
regression  estimates  of  four  different  mean  response  functions  are  shown  at  each  of  27 
locations.  In  addition,  local  regression  estimates  of  the  derivatives  of  up  to  third  order  are 
acquired  but  are  not  depicted  in  Figure  3  to  avoid  making  the  graph  illegible.  Then 
Steinization  is  performed  on  a  16-dimensional  object  at  each  of  the  27  locations,  namely 
the  vector  whose  components  are  the  four  mean  response  functions  and  their  derivatives 
of  up  to  third  order  at  that  location.  One  could  similarly  employ  Steinization  with 
compound  estimation  instead  of  local  regression,  but  doing  so  would  destroy  the  self- 
consistency  property  that  motivated  compound  estimation  in  the  first  place.  Thus, 
exploiting  Stein’s  phenomenon  in  compound  estimation  is  more  delicate.  One  way  to  do 
so,  and  in  fact  what  is  actually  shown  in  Figure  3,  is  to  Steinize  the  pointwise  estimates 
(acquired  from  local  regression)  that  will  be  used  in  the  first  stage  of  compound 
estimation  [1].  Then  proceed  with  the  second  stage  of  compound  estimation,  namely  the 
convolution  of  local  polynomials  defined  using  the  pointwise  estimates,  in  the  manner 
previously  advocated  [1].  Since  self-consistency  is  wholly  a  by-product  of  the 
convolution  of  local  polynomials  and  has  nothing  to  do  with  the  properties  of  the 
pointwise  estimates,  self-consistency  is  not  jeopardized.  Yet,  if  the  pointwise  estimates 
are  improved  by  Steinization,  then  one  anticipates  that  the  resulting  compound  estimates 
will  likewise  be  improved. 

What  is  described  above  is  not  the  only  way  to  exploit  Stein’s  phenomenon  in  compound 
estimation.  For  instance,  instead  of  performing  Steinization  on  a  16-dimensional  object  at 
each  of  the  27  locations,  Steinization  could  be  performed  on  various  nine-dimensional 
objects.  One  such  object  would  be  the  estimates  of  the  first  mean  response  function  at  the 
first  nine  locations.  Taking  into  account  that  there  are  three  clusters  of  nine  locations, 
four  mean  response  functions,  and  derivatives  of  up  to  third  order,  one  sees  that  there  are 
48  nine-dimensional  objects  on  which  Steinization  could  be  performed.  We  refer  to  the 
former  approach  as  “Steinization  across  outcomes  and  orders  of  derivatives”  and  to  the 
latter  as  “Steinization  across  centering  points”.  Simulation  studies  suggest  that  the  latter 
approach  may  be  more  effective  in  reducing  squared  error  and,  not  surprisingly  given 
consideration  (ii),  that  the  benefits  of  Steinization  are  greater  for  estimating  derivatives 
than  for  estimating  mean  responses  [4];  see  Table  2. 
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Figure  3  (adapted  from  [4]):  Incorporating  Steinization  into  nonparametric  regression. 
Shown  in  black  are  pointwise  estimates  of  an  M34  seattering  profile  at  the  27  eentering 
points  used  in  compound  estimation,  whieh  are  eonverted  to  the  pointwise  estimates  in 
green  after  Steinization.  Shown  in  red  are  pointwise  estimates  of  Mii,  M12,  and  M33 
seattering  profiles.  Pointwise  estimates  in  green  represent  a  compromise  of  pointwise 
estimates  in  blaek  with  pointwise  estimates  in  red  (along  with  12  other  sets  of  pointwise 
estimates  not  displayed  to  avoid  making  the  graph  illegible),  which  is  most  easily  seen  at 
the  top  of  the  graph. 
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Squared  error 
of  estimation 

for: 

Seenario  1 

w/  and  w/o 

Steinization 

Seenario  2 

w/  and  w/o 

Steinization 

Seenario  3 

w/  and  w/o 

Steinization 

Seenario  4 

w/  and  w/o 

Steinization 

mean 

responses 

0.255;  0.273 
(7%  reduetion) 

0.125;  0.126 
(1%  reduetion) 

0.269;  0.272 
(1%  reduetion) 

0.125;  0.125 
(0%  reduction) 

first 

derivatives 

0.151;  0.202 
(25%  reduetion) 

0.049;  0.056 
(13%  reduetion) 

0.200;  0.205 
(2%  reduetion) 

0.055;  0.055 
(0%  reduction) 

seeond 

derivatives 

0.135;  0.196 
(31%  reduetion) 

0.042;  0.051 
(18%  reduetion) 

0.196;  0.200 
(2%  reduetion) 

0.050;  0.050 
(0%  reduction) 

summation 

0.541;  0.671 
(19%  reduetion) 

0.216;  0.233 
(7%  reduetion) 

0.665;  0.677 
(2%  reduetion) 

0.229;  0.230 
(0%  reduction) 

Table  2  (adapted  from  [4]):  Effects  of  Steinization  on  estimation  error.  Entries  are 
averages,  over  600  data  sets  designed  to  mirnie  results  from  seattering  experiments 
intended  for  nanopartiele  eharaeterization  [37],  of  the  sum  of  squared  errors  in  estimating 
four  mean  response  funetions  (eorresponding  to  four  distinet  elements  of  the  Mueller 
seattering  matrix  [25]),  their  first  derivatives,  and  their  seeond  derivatives,  expressed  as 
fraetions  of  the  squared  Frobenius  norm  of  the  nonparametrie  regression  smoothing 
matrix  in  the  absenee  of  Steinization.  Scenarios  1  and  2  entailed  eompound  estimation 
with  Steinization  of  loeal  regression  pointwise  estimates  aeross  elusters  of  nine  eentering 
points,  while  Scenarios  3  and  4  entailed  eompound  estimation  with  Steinization  of  loeal 
regression  pointwise  estimates  aeross  four  outeome  variables  and  orders  of  derivatives. 
Seenarios  1  and  3  had  a  signal  to  noise  ratio  of  4,  while  Seenarios  2  and  4  had  a  signal  to 
noise  ratio  of  16.  The  shrinkage  faetors  used  in  Steinization  assumed  that  estimation  bias 
in  nonparametrie  regression  was  negligible  relative  to  estimation  varianee. 


Our  final  endeavor  for  Objeetive  1  was  to  develop  a  tuning  parameter  seleetion  eriterion 
for  nonparametrie  regression  problems  with  multiple  eovariates  whose  values  may  be 
irregularly  spaeed,  when  there  is  interest  in  estimating  derivatives  of  a  mean  response 
funetion  rather  than  (only)  the  mean  response  funetion  itself  [11].  Apart  from  an 
asymptotieally  negligible  remainder,  our  multivariate  generalized  Cp  (GCp)  has  expeeted 
value  equal  to  the  sum  of  squared  errors  of  a  fitted  derivative,  whieh  eannot  be  ealeulated 
in  praetiee  beeause  the  true  derivative  is  unknown.  Thus,  multivariate  GCp  is  a  data-based 
proxy  for  this  sum  of  squared  errors  whose  minimization  is  hoped  to  also  (approximately) 
minimize  the  sum  of  squared  errors. 

The  statistieal  motivation  for  developing  multivariate  GCp  is  that  existing  tuning 
parameter  seleetion  eriteria  have  (at  least)  one  of  the  following  limitations: 
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(i)  They  may  minimize  a  proxy  for  the  sum  of  squared  errors  of  the  fitted  mean 
response  function,  not  the  fitted  derivative,  potentially  leading  to  undersmoothed 
estimated  derivatives  [14,15]. 

(ii)  They  may  be  designed  to  work  with  a  single  nonparametric  regression 
method,  such  as  local  regression  [12]  or  spline  smoothing  [13],  without  being  applicable 
to  other  nonparametric  regression  methods. 

(iii)  They  may  only  accommodate  scenarios  with  one  predictor  variable  [16]. 

However,  multivariate  GCp  does  not  have  these  limitations.  In  fact,  multivariate  GCp  can 
be  used  with  any  nonparametric  regression  method  that  produces  estimates  linear  in  the 
outcome  variable  at  particular  values  of  the  tuning  parameters. 

Besides  the  potential  application  in  nanoparticle  characterization,  a  motivating 
biostatistical  application  that  we  pursued  was  to  relate  a  person’s  blood  level  of  bilirubin 
(one  measure  of  liver  function)  to  the  two  principal  components  defined  by  his  or  her 
levels  of  aspartate  aminotransferase  (AST)  and  alanine  aminotransferase  (ALT)  (two 
other  measures  of  liver  function,  which  may  be  easier  and  less  expensive  to  monitor 
routinely  [39]).  Using  publicly  available  data  [40],  we  compared  multivariate  GCp  to 
ordinary  Cp  and  generalized  cross  validation  [14,15]  (among  other  tuning  parameter 
selection  methods)  in  their  abilities  to  recover  good  tuning  parameters  for  compound 
estimation  [2]  of  the  aforementioned  relationship. 

Figure  4  displays  the  estimated  mean  responses,  while  Figure  5  displays  the  estimated 
partial  derivatives  with  respect  to  the  first  principal  component.  The  latter  graph  is 
particularly  revealing  because  negative  fitted  values  occur  when  ordinary  Cp  and 
generalized  cross  validation  select  tuning  parameters.  Moreover,  these  negative  fitted 
values  are  difficult  to  defend  because  they  occur  in  a  region  where  the  data  are  sparse.  In 
contrast,  no  such  artifacts  are  observed  when  multivariate  GCp  selects  tuning  parameters. 
And,  unlike  the  estimate  resulting  from  ordinary  Cp  and  generalized  cross  validation,  this 
estimate  does  not  appear  to  be  undersmoothed.  In  fact,  a  simulation  study  suggests  that 
multivariate  GCp  leads  to  more  accurate  estimation  of  first-order  derivatives  than 
ordinary  Cp  and  generalized  cross  validation,  at  least  when  the  signal  to  noise  ratio  is  not 
too  low;  see  Table  3.  When  the  signal  to  noise  ratio  is  very  low,  conventional  tuning 
parameter  selectors  may  fare  comparatively  well  because  their  tendencies  to  undersmooth 
are  negated  by  very  high  noise.  However,  in  such  cases,  one  may  question  the 
appropriateness  of  attempting  to  estimate  derivatives  at  all. 
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mean  response  (0.20,50) 


mean  response  (0.50,50) 


xl 

Figure  4  (adapted  from  [11]):  Tuning  parameter  selection  and  mean  response 
estimation.  Displayed  are  two  compound  estimates  of  a  mean  response  function.  For  the 
top  estimate,  tuning  parameters  (0.20  nearest  neighbor  fraction  and  50  convolution 
parameter)  were  selected  by  Cp  and  generalized  cross  validation.  For  the  bottom  estimate, 
tuning  parameters  (0.50  and  50)  were  selected  by  multivariate  GCp.  Superimposed  in 
small  *  symbols  are  the  predictor  variable  values. 


14 


x1  derivative  (0.20,50) 
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Figure  5  (adapted  from  [11]):  Tuning  parameter  selection  and  partial  derivative 
estimation.  Displayed  are  two  compound  estimates  of  a  first-order  partial  derivative  of  a 
mean  response  function.  For  the  top  estimate,  tuning  parameters  (0.20  nearest  neighbor 
fraction  and  50  convolution  parameter)  were  selected  by  Cp  and  generalized  cross 
validation.  For  the  bottom  estimate,  tuning  parameters  (0.50  and  50)  were  selected  by 
multivariate  GCp.  Superimposed  in  small  *  symbols  are  the  predictor  variable  values. 
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Scenario 

multivariate 
generalized  Cp 

ordinary  Cp 

generalized 
cross  validation 

1 

1.07 

1.08 

1.08 

2 

1.10 

1.14 

1.14 

3 

1.14 

1.10 

1.10 

4 

1.07 

1.13 

1.13 

5 

1.07 

1.08 

1.08 

6 

1.07 

1.05 

1.05 

Table  3  (adapted  from  [11]):  Comparing  multivariate  generalized  Cp  to  its  competitors. 
Each  entry  is  the  median  over  100  simulated  data  sets  of  the  ratio  of  the  squared  error  in 
estimating  first-order  derivatives  aehieved  with  the  tuning  parameter  seleetion  eriterion 
divided  by  the  smallest  possible  squared  error  that  eould  be  aehieved  by  an  oraele 
selecting  the  best  tuning  parameters.  The  mean  response  function  in  scenarios  1,  2,  and  3 
is  defined  to  be  the  graph  in  the  top  panel  of  Figure  4,  while  the  mean  response  funetion 
in  seenarios  4,  5,  and  6  is  defined  to  be  the  graph  in  the  bottom  panel.  For  scenarios  1  and 
4,  error  varianees  were  two-thirds  the  magnitude  estimated  from  the  original  data  set;  for 
seenarios  2  and  5,  equal  to  that  magnitude;  for  seenarios  3  and  6,  three-halves  that 
magnitude. 


Objective  2 

Edalatpour  and  Franeoeur  [17]  developed  a  novel  framework  for  solving  near- field 
thermal  radiation  problems  in  three-dimensional  arbitrary  geometries.  The  method  ealled 
the  T-DDA  does  not  make  any  assumptions  on  the  shape  and  the  size  of  the  objects  as 
well  as  on  their  separation  distanees.  In  the  T-DDA,  objeets  are  discretized  into  eubical 
sub-volumes  eonceptualized  as  eleetric  point  dipoles.  After  diseretization,  the  following 
system  of  equations  is  obtained: 

A-P  =  E,>,c  (1) 

where  A  is  the  3  A  by  3  A  interaetion  matrix  (A  is  the  number  of  sub-volumes),  Emc  is  the 
3A  eolumn  vector  containing  the  incident  field  and  P  is  the  3A  eolumn  vector  containing 
the  unknown  dipole  moments.  The  form  of  Eq.  (1)  is  the  same  as  the  system  of  equations 
employed  in  regular  DDA  methods  for  predieting  light  seattering  by  particles  [26]. 
However,  in  thermal  radiation  problems,  the  ineident  field  is  due  to  thermal  emission. 
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such  that  Ei„c  is  a  stochastic  vector.  Consequently,  the  unknown  dipole  moment  vector 
P  is  also  stochastic. 

The  first  moment  (mean)  of  the  incident  field  is  zero,  while  its  second  moment  is 
computed  via  the  fluctuation-dissipation  theorem  (FDT)  [27].  The  FDT  relates  thermal 
fluctuations  of  currents  inside  a  body  to  its  local  temperature.  The  quantity  of  interest  in 
radiation  heat  transfer  is  the  net  power  exchanged  that  is  proportional  to  the  trace  of  the 
dipole  auto-correlation  function.  This  implies  that  the  instantaneous  dipole  moment 
vector  P  does  not  need  to  be  calculated  directly  (only  the  auto-correlation  function  of  P 
is  needed).  This  fact  is  extremely  important,  as  it  shows  that  the  T-DDA  is  a  deterministic 
method  even  if  near-field  thermal  radiation  problems  are  fundamentally  stochastic. 

The  T-DDA  has  been  tested  against  the  analytical  solution  of  Narayanaswamy  and  Chen 
where  near-field  radiative  heat  transfer  between  two  large  spheres  is  calculated  [28]. 
More  specifically,  the  quantity  reported  is  the  thermal  radiative  conductance  defined  as 
follows: 


G 


CO 


=  lim 

<5r^o 


(2) 


where  is  the  net  heat  rate  between  the  objects  while  T  is  the  temperature.  Figure  6 

shows  the  conductance  for  two  spheres  made  of  silica  calculated  using  the  T-DDA  and 
the  analytical  solution  of  Narayanaswamy  and  Chen.  The  diameters  of  the  spheres  are 
fixed  at  0.5  pm  while  the  temperature  is  kept  at  400  K;  separation  distances  of  0.5  and 
0.2  pm  are  considered  in  panels  (a)  and  (b),  respectively. 


Frequency  [eV] 
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(a) 


Frequency  [eV] 

(b) 

Figure  6.  Spectral  conductance  between  two  silica  spheres  for  separation  gaps  of:  (a)  0.5 
|u.m,  and  (b)  0.2  |Lim  (the  numbers  in  parentheses  correspond  to  the  number  of  sub¬ 
volumes  in  each  sphere). 


It  can  be  seen  that  the  T-DDA  is  in  excellent  agreement  with  the  analytical  solution.  For 
552  sub-volumes  in  each  sphere,  the  relative  differences  between  the  total  conductance 
(i.e.,  integrated  over  all  frequencies)  calculated  from  the  T-DDA  and  the  analytical 
solution  are  0.35%  and  6.4%  for  separation  gaps  of  0.5  and  0.2  qm,  respectively. 

The  analytical  solution  of  Narayanaswamy  and  Chen  [28]  is  based  on  a  complicated  and 
cumbersome  expansion  of  the  fields  in  vector  spherical  harmonics  (Mie-type  solution), 
and  is  thus  restricted  to  the  two-sphere  configuration.  For  instance,  the  case  of  three 
spheres  cannot  be  treated  with  the  solution  of  Ref  [28],  while  it  is  straightforward  to  add 
objects  in  the  T-DDA.  Additionally,  the  T-DDA  is  versatile  as  any  type  of  geometries  can 
be  considered.  As  an  illustrative  example,  the  conductance  between  two  cubes  of  silica 
calculated  with  the  T-DDA  is  shown  in  Figure  7.  Both  cubes  have  side  length  of  0.5  pm 
and  are  separated  by  a  distance  of  0.5  pm;  as  before,  the  temperature  is  fixed  at  400  K. 
Note  that  an  analytical  solution  does  not  exist  for  this  problem. 


18 


Frequency  [eV] 

Figure  7.  Spectral  conductance  between  two  silica  cubes  separated  by  a  0.5  |u,m-thiek  gap 
(the  numbers  in  parentheses  correspond  to  the  number  of  sub-volumes  in  each  cube). 


A  fast  convergence  of  the  T-DDA  is  observed  in  Figure  7.  The  relative  difference 
between  the  total  conductance  for  64  and  216  sub-volumes  in  each  cube  is  1.5%,  while 
the  relative  difference  is  0.6%  between  216  and  512  sub-volumes  (result  not  shown). 

A  further  development  of  the  T-DDA  is  the  addition  of  a  semi-infinite  surface  (method 
called  T-DDA-SI).  No  such  model  currently  exists  in  the  literature.  This  allows  for  the 
very  first  time  to  account  for  the  interactions  between  objects  of  arbitrary  shapes  and 
sizes  and  a  surface.  The  T-DDA-SI  will  impact  not  only  nano-TPV  power  generation,  but 
also  applications  such  as  thermal  microscopy  [29]. 

The  presence  of  the  surface  modifies  the  original  T-DDA  formulation  in  two  ways 
[30,31].  First,  the  incident  should  account  for  waves  emitted  by  dipoles  that  reflect  off  the 
surface  before  reaching  another  dipole.  Secondly,  the  interaction  mechanism  between 
dipoles  i  and  j  is  modified  by  adding  an  extraneous  contribution  due  to  radiation  from 
dipole  j  that  reflects  off  the  surface  before  reaching  dipole  i.  In  the  presence  of  a  surface, 
the  linear  system  of  equations  to  be  solved  becomes: 

(A  +  Rj-P  =  E;„c  (3) 


where  R  is  the  deterministic  reflection-interaction  matrix.  The  integration  of  surface 
interactions  within  the  T-DDA  is  essentially  based  on  the  work  of  Sommerfeld  that 
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analyzed  electric  dipole  radiation  above  a  semi-infinite  plane  [32],  The  electric  field 
reflected  off  the  surface  is  calculated  using  the  Sommerfeld  identity  [33]; 
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ik„r 


47rr 


4^  0  Ko 


(4) 


where  and  kzo  are  respectively  the  wavevectors  parallel  and  perpendicular  to  the 

surface,  while  Jo  is  the  Bessel  function  of  the  zeroth  order.  The  purpose  of  the 
Sommerfeld  identity  is  to  decompose  spherical  waves  into  a  summation  of  cylindrical 
waves  propagating  parallel  to  the  surface  and  plane  waves  propagating  along  the  z- 
direction  normal  to  the  surface.  As  such,  the  cylindrical  waves  do  not  interact  with  the 
surface;  only  the  plane  waves  interact  with  the  surface,  and  this  interaction  is  calculated 
via  the  standard  Fresnel  reflection  coefficients. 

By  performing  this  analysis  for  the  horizontal  electric  dipole  and  the  vertical  electric 
dipole  oscillations  (relative  to  the  surface),  the  electric  fields  reflected  off  the  surface  in 
both  TM  and  TE  polarizations  due  to  dipole  radiation  are  calculated.  These  reflected 

electric  fields  are  then  added  to  the  incident  field  vector  Emc .  The  reflected  electric  fields 
are  also  employed  for  populating  the  reflection-interaction  matrix.  This  is  done  by 
considering  the  reflected  field  at  r/  due  to  dipole  radiation  at  rj: 

(5) 

Using  the  reflected  electric  fields  in  combination  with  Eq.  (5),  the  expression  for  the  local 
reflection-interaction  matrix  R,y  for  all  dipole  pairs  is  derived.  Then,  the  global 

reflection-interaction  matrix  R  can  be  populated.  The  mathematical  details  are  omitted 
here  due  to  lengthy  equations.  With  the  T-DDA-SI,  near-field  thermal  radiation  problems 
in  three-dimensional  complex  geometries  involving  objects  near  or  on  a  surface  are 
tractable  for  the  very  first  time.  It  is  worth  noting  that  the  extension  to  multiple  surfaces 
is  straightforward  via  calculation  of  plane  wave  reflection  and  transmission  from  multiple 
plane  layers  [31]. 

Eigure  8  shows  a  proof-of-concept  of  the  T-DDA-SI,  where  the  net  heat  transfer  between 
a  nanoparticle  at  300  K  (radius  of  1  pm;  modeled  as  a  single  electric  point  dipole)  and  a 
surface  at  321  K,  both  made  of  silica,  is  shown  as  a  function  of  the  gap.  Note  that  the  gap 
here  corresponds  to  the  distance  between  the  edge  of  the  particle  and  the  surface. 
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Figure  8.  Net  heat  transfer  (normalized  by  the  far  field  heat  transfer)  between  a  dipole 
and  a  surface:  Comparison  between  T-DDA-SI  and  Otey  and  Fan  [18]. 


Results  from  the  T-DDA-SI  are  in  perfect  agreement  with  those  from  Otey  and  Fan  [18] 
for  a  nanoparticle  exchanging  radiation  with  a  surface.  The  T-DDA-SI  is  currently  being 
validated  against  experimental  data  for  a  large  sphere  and  a  surface  [19].  These  results 
along  with  the  details  of  the  T-DDA-SI  will  be  discussed  in  a  paper  to  be  submitted  in 
2013  [20]. 


Objective  3 

The  evanescent  wave-based  characterization  framework  includes  three  essential 
components:  (1)  the  forward  model  for  predicting  evanescent  wave  scattering  by  particles 
on  a  surface,  (2)  an  experimental  device  for  measuring  scattering  profiles,  and  (3)  an 
inversion  algorithm  in  order  to  retrieve  the  nanoparticles’  characteristics  from  measured 
scattering  profiles.  Prediction  of  evanescent  wave  scattering  by  particles  on  a  surface  is 
accomplished  via  the  Discrete  Dipole  Approximation  with  Surface  Interaction  (DDA-SI) 
[21,22].  In  this  work,  significant  progresses  have  been  made  in  the  establishment  of  the 
DDA-SI  as  a  forward  model  for  the  characterization  framework. 

In  order  to  assess  the  accuracy  of  the  forward  model  in  the  characterization  framework, 
the  DDA-SI  has  been  validated  for  the  first  time  against  scaled  microwave  experimental 
results  [23,24].  Multiple  experiments  were  made  in  the  frequency  band  of  from  4  to  6 
GHz  (wavelengths  A.  of  from  5  to  7.5  cm)  for  a  variety  of  lossless  and  lossy  cubes  and 
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spheres  with  size  parameters  of  from  1/2.5  to  1/0.83.  Figures  9  to  11  show  seleeted 
validation  results,  where  it  is  clear  that  the  DDA-Sl  provides  accurate  predictions  of 
evanescent  wave  scattering  by  particles  on  a  surface.  Note  that  results  from  a  FEM  are 
also  included  for  supplementary  verifications. 


Figure  9.  TE-polarized  electric  field  scattered  in  the  far  zone  by  a  lossless  cube  (1/1.785 
side  length)  illuminated  by  an  evanescent  wave;  Comparison  between  experimental, 
DDA-Sl  and  FEM  results. 


Figure  10.  TE-polarized  electric  field  scattered  in  the  far  zone  by  a  lossless  sphere 
(1/1.923  diameter)  illuminated  by  an  evanescent  wave;  Comparison  between 
experimental,  DDA-Sl  and  EEM  results. 
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(b) 
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(c) 

Figure  11.  TE-polarized  electric  field  scattered  in  the  far  zone  by  an  absorbing  sphere 
(/I/O. 865  diameter)  illuminated  by  an  evanescent  wave:  (a)  DDA-SI.  (b)  FEM.  (c) 
Experimental  results. 


Additionally,  an  extensive  sensitivity  analysis  of  the  Mueller  matrix  elements  to  the 
particles’  properties  has  been  conducted  via  the  DDA-SI.  The  results  of  this  sensitivity 
analysis,  defining  the  best  elements  to  employ  in  order  to  characterize  a  given  parameter, 
are  available  in  the  MS  thesis  of  Short  [34]. 

The  preceding  suggests  that  the  DDA-SI  can  be  employed  as  the  forward  model  in  an 
evanescent  wave-based  characterization  framework.  Our  future  research  agenda  will 
include  development  of  the  corresponding  inverse  model.  Preliminary  investigations  have 
already  been  undertaken  to  assess  the  viability  of  compound  estimation  for  that  purpose. 

Recall  from  our  description  of  progress  for  Objective  1  that  compound  estimation  was 
adapted  to  multiple  outcome  variables  [4]  (corresponding  to,  for  example,  multiple 
elements  of  the  Mueller  scattering  matrix  [25])  by  incorporating  nonlinear  shrinkage  a  la 
Stein’s  phenomenon  [6,7,8].  So  far  we  have  considered  two  forms  of  Steinization  for 
compound  estimation,  one  in  which  local  regression  pointwise  estimates  in  clusters  of 
nine  points  are  shrunk  toward  each  other,  and  one  in  which  local  regression  pointwise 
estimates  in  groups  of  sixteen  (defined  by  four  different  outcome  variables  and  orders  of 
derivatives  under  consideration)  are  shrunk  toward  each  other.  As  previously  shown  in 
Table  2,  both  forms  of  Steinization  can  reduce  estimation  error  if  the  signal-to-noise  ratio 
is  not  too  high,  and  Steinization  across  centering  points  can  do  so  even  if  the  signal-to- 
noise  ratio  is  rather  high. 
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However,  as  Table  4  reveals,  redueed  estimation  error  does  not  neeessarily  translate  into 
more  aceurate  nanopartiele  eharaeterization.  Such  a  disparity  seems  to  occur  because, 
while  Steinization  tends  to  reduce  the  distance  between  the  scattering  profiles 
estimated  from  a  noisy  data  set  to  the  actual  scattering  profiles  for  nanoparticles  of  the 
same  configuration  (and  similarly  for  the  respective  derivatives),  Steinization  also  tends 
to  reduce  the  distance  between  the  estimated  scattering  profiles  and  the  actual 
scattering  profiles  for  nanoparticles  of  different  configurations  (and,  again,  similarly  for 
the  respective  derivatives). 


Correct 

classification 

rates  based  on 
minimization  of  1? 
distance  between: 

Scenario  1 

w/  and  w/o 

Steinization 

Scenario  2 

w/  and  w/o 

Steinization 

Scenario  3 

w/  and  w/o 

Steinization 

Scenario  4 

w/  and  w/o 

Steinization 

mean  responses 

537/600; 

541/600 

552/600; 

552/600 

545/600; 

546/600 

552/600; 

554/600 

first  derivatives 

448/600; 

440/600 

550/600; 

550/600 

453/600; 

453/600 

544/600; 

543/600 

second  derivatives 

276/600; 

263/600 

379/600; 

374/600 

253/600; 

253/600 

352/600; 

354/600 

summation 

543/600; 

547/600 

561/600; 

563/600 

551/600; 

551/600 

559/600; 

559/600 

Table  4  (adapted  from  [4]):  Effects  of  Steinization  on  nanopartiele  characterization. 
Entries  represent  the  numbers  out  of  600  data  sets,  designed  to  mimic  results  from 
scattering  experiments  intended  for  nanopartiele  characterization  [37],  for  which  the 
“correct”  configuration  of  nanoparticles  was  guessed  based  on  the  principle  of 
minimizing  \f  distance.  For  example,  entries  in  the  “mean  responses”  rows  pertain  to 
minimizing  distance  between  the  scattering  profiles  estimated  from  a  particular  noisy 
data  set  and  the  actual  scattering  profiles  for  the  “correct”  configuration  and  three 
“incorrecf’  configurations  of  nanoparticles;  whichever  of  these  four  if  distances  is 
smallest  determines  the  guess  for  the  configuration  of  nanoparticles  from  which  the 
particular  noisy  data  set  was  generated.  Since  this  was  a  simulation  experiment,  we  knew 
which  guesses  were  correct  and  could  calculate  correct  classification  rates.  Scenarios  1 
through  4  are  as  described  in  the  caption  to  Table  2  above. 
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On  the  other  hand,  the  former  reduetions  tend  to  be  greater  than  the  latter  reduetions.  This 
suggests  that  if  we  make  our  classifieations  probabilistie,  by  eomparing  the  distanees 
and  assigning  indiees  of  plausibility  to  the  various  configurations,  instead  of  merely 
identifying  the  most  plausible  configuration  for  a  particular  noisy  data  set,  some  further 
benefits  of  Steinization  may  manifest.  For  example,  even  if  the  correct  classification  rate 
is  unchanged  from  559/600  after  Steinization  is  performed,  perhaps  the  experimenter  can 
go  from  being  “80%  confident”  in  the  classifications  to  “90%  confident”  after 
Steinization.  Of  course,  concepts  such  as  “80%  confidenf  ’  and  “90%  confident”  must  be 
formalized,  and  this  will  be  done  later  in  our  future  research,  but  the  point  for  now  is  that 
probabilistic  classifications  may  be  more  definitive  with  Steinization  than  without.  And, 
presumably,  an  experimenter  would  wish  to  have  greater  confidence  in  his  or  her 
inferences  about  nanoparticle  configurations  before  using  them  to  engineer 
nanostructured  surfaces  for  efficient  nano-TPV  power  generation.  Thus,  despite  the 
mixed  results  in  Tables  2  and  4  above,  we  have  some  viable  avenues  to  pursue  in  our 
future  research  regarding  the  use  of  compound  estimation  in  an  inverse  model  for 
nanoparticle  characterization  geared  toward  efficient  nano-TPV  power  generation.  We 
eagerly  look  forward  to  pursuing  these  avenues,  and  we  thank  the  ARO  for  its 
sponsorship  of  the  phase  of  our  work  just  completed. 
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nanoparticle  characterization,”  Journal  of  the  Optical  Society  of  America  Series  A,  to  be 
submitted  in  2014. 

S.  Edalatpour  and  M.  Francoeur,  “Near- field  radiative  heat  transfer  between  complex¬ 
shaped  objects  and  a  surface,”  Physical  Review  Letters,  to  be  submitted  later  in  2013. 

L.  Feng,  R.  Charnigo,  and  C.  Srinivasan.  “Stein’s  phenomenon  in  nonparametric 
regression,”  Journal  of  the  Royal  Statistical  Society  Series  B,  to  be  submitted  in  2014. 

L.  Feng,  R.  Charnigo,  and  C.  Srinivasan.  “James-Stein  type  compound  estimation  of 
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R.  Vaillon,  B.  Lacroix,  J.-M.  Geffrin,  M.  Francoeur,  and  R.  Abdeddaim,  “Far-field  and 
near-field  surface  wave  seattering  mierowave  seanner,”  Review  of  Scientific  Instruments, 
to  be  submitted  later  in  2013. 

(e)  Books:  None. 

(f)  Honors  and  Awards:  No  honors  or  awards  were  made  speeifieally  for  any  of  the 
papers  written  under  this  ARO  sponsorship.  However,  Dr.  Charnigo  was  promoted  from 
assoeiate  professor  (with  tenure)  to  full  professor  (with  tenure)  at  the  University  of 
Kentueky,  effeetive  July  1,  2013,  in  part  due  to  internal  and  external  reeognition  of  this 
ARO  sponsorship.  Moreover,  Dr.  Franeoeur  reeeived  the  NSF  Career  Award  in  July  2013 
for  demonstrating  experimentally  enhaneed  power  generation  in  nanoseale-gap 
thermophotovoltaie  power  generators;  he  believes  that  this  ARO  sponsorship  eontributed 
to  his  aequisition  of  the  NSF  Award. 

(g)  Title  of  patents  diselosed  during  the  reporting  period:  None. 

(h)  Patents  disclosed  during  the  reporting  period:  None. 
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Appendix  2:  Students  and  personnel  supported 


(a)  Graduate  students:  Three  graduate  students  partially  supported. 

Sheila  Edalatpour,  PhD  eandidate,  Department  of  Meehanieal  Engineering,  Elniversity  of 
Utah  (2011  -  Present).  Einancial  Support:  30%  PTE  for  one  semester  (thus, 
approximately  1 1%  of  one  year’s  effort). 

Eimin  Peng,  PhD,  Department  of  Statistics,  University  of  Kentucky  (2008  -  2013). 
Einancial  Support:  100%  PTE  for  a  four-week  interval  between  semesters  (thus, 
approximately  8%  of  one  year’s  effort).  However,  her  total  participation  in  ARO- 
sponsored  research  exceeded  this  amount  of  time  because  she  also  pursued  the  research 
in  her  dissertation,  in  partial  fulfillment  of  the  academic  requirements  for  the  PhD. 

Mitchell  R.  Short,  MS,  Department  of  Mechanical  Engineering,  University  of  Utah  (2011 
-  2013).  Einancial  Support:  100%  PTE  for  one  semester  (thus,  approximately  38%  of  one 
year’s  effort). 

(b)  Post  doctorates:  None. 

(c)  Paculty 

Richard  Chamigo,  Professor,  Departments  of  Statistics  and  Biostatistics,  University  of 
Kentucky.  Support:  1.47  months’  salary  (thus,  approximately  12%  of  one  year’s  effort). 

Mathieu  Prancoeur,  Assistant  Professor,  Department  of  Mechanical  Engineering, 
University  of  Utah.  Support:  0.50  months’  salary  (thus,  approximately  4%  of  one  year’s 
effort). 

Cidambi  Srinivasan,  Professor,  Department  of  Statistics,  University  of  Kentucky. 
Support:  1.47  months’  salary  (thus,  approximately  12%  of  one  year’s  effort). 

(d)  Undergraduate  students:  None. 

(e)  Graduating  undergraduate  metrics:  Not  applicable. 

(f)  Master  degrees  awarded:  One. 

Mitchell  R.  Short,  MS,  Department  of  Mechanical  Engineering,  University  of  Utah, 
2011-2013.  Thesis  title:  The  discrete  dipole  approximation  with  surface  interaction  for 
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evanescent  wave-based  characterization  of  nanostructures  on  a  surface  with  validation 
against  experimental  results. 

(g)  PhDs  awarded:  One. 

Limin  Feng,  PhD,  Department  of  Statistics,  University  of  Kentucky,  2008-2013. 
Dissertation  title:  James-Stein  type  compound  estimation  of  multiple  mean  response 
functions  and  their  derivatives. 

(h)  Other  research  staff:  None. 
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Appendix  3:  Technology  transfer 


There  are  no  inventions  or  patents  associated  with  this  ARO  sponsorship.  However,  the 
statistical  methods  proposed  in  “Nonparametric  and  semiparametric  compound 
estimation  in  multiple  covariates”  and  “A  multivariate  generalized  Cp  and  surface 
estimation”  have  been  implemented  in  the  R  computing  environment.  These 
implementations  will  be  made  freely  available  to  anyone,  upon  request  to  Dr.  Charnigo, 
once  the  corresponding  manuscripts  have  been  published  in  peer-review  journals. 
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Appendix  4:  Copies  of  technical  reports 


The  manuscript  “Stein’s  phenomenon  and  nanoparticle  characterization”  already  exists  in 
technical  report  form  and  has  been  uploaded  to  the  ARO  reporting  website  along  with 
this  final  project  report.  The  manuscript  will  be  improved  and  submitted  to  the  Journal 
of  the  Optical  Society  of  America  Series  A,  or  a  similar  peer-review  journal,  in  2014.  We 
have  also  uploaded  manuscripts  that  have  already  been  submitted  to  but  have  not  yet  been 
accepted  by  peer-review  journals,  as  well  as  conference  or  presentation  abstracts,  the  MS 
thesis  of  Mitchell  Short,  and  the  PhD  dissertation  of  Limin  Feng. 
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